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We measure the dependence of B"-^D*^ e^Ve on the decay angles and momentum transfer. The 
data sample consists of ~ 86 x 10® _B_B-pairs accumulated on the T(4S) resonance by the BABAR 
detector at the asymmetric e'^e~ collider PEP-II. We specify the three form factors by two ratios _Ri 
and R2, and by a single parameter characterizing the polynomial representing /iai, the function 
which describes the momentum-transfer dependence of the form factor A\. We determine Ri, R2, 
and using an unbinned maximum likelihood fit to the full decay distribution. The results are Ri = 
1.396±0.060±0.035±0.027, R2 = 0.885±0.040±0.022±0.013, and = 1.145±0.059±0.030±0.035. 
The stated uncertainties are the statistical from the data, statistical from the size of the Monte Carlo 
sample and the systematic uncertainty, respectively. In addition, based on this measurement, we 
give an updated value for the CKM matrix element |Vc6|. 
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PACS numbers: 13.25.Hw, 12.15.Hh, ll.30.Er 

I. INTRODUCTION 

The hadronic weak current of the exclusive semilep- 
tonic decay B^D*lv with a light lepton £ = e or n can 
be described by two axial form factors Ai and A2, and 
one vector form factor V, which are functions of the B- 
to-D* momentum transfer squared, (and though here 
we present results only for the B^^D*^e~Ve decay, the 
results are expected to be the same with the electron re- 
placed by a muon, and thus apply to all B^ D*~^£~i7 
decays where £ is a light charged lepton). These form 
factors are usually characterized in terms of their ratio 
parameters Ri and i?2, and a slope parameter p^. These 
terms will be defined precisely in Sec. II. First measure- 
ments of these three parameters were made by the CLEO 
collaboration [1]. Improved measurements are important 
for a significant reduction of the experimental error in 
the determination of the Cabibbo-Kobayashi-Maskawa 
(CKM) matrix element \Vcb\- They also provide for im- 
proved determinations of \Vub\ from inclusive and exclu- 
sive semileptonic b — s- uiv decays by improving the ac- 
curacy with which the dominant b — > civ background to 
these decays is known. 

In heavy-quark effective field theory (HQET) [2, 3], 
the three form factors are related to each other through 
heavy quark symmetry (HQS). HQET allows for three 
free parameters, which must be determined by exper- 
iment. Deviations from the HQS relationships can be 
computed as corrections to the theory. They can also be 
measured, and the parameterization we adopt is inspired 
by HQET, but allows for such deviations to be extracted 
and compared against HQET predictions, as we will see 
in Sec. X. 

The data used in this analysis were recorded by 
the BABAR detector at the PEP-II storage ring, and 
correspond to 79 fb~^ integrated on the T(4S) res- 
onance, yielding 86 x 10^ i3i?-pairs. There are 
~ 5 X 10^ B^ — !■ 0*61^ decays in this sample, of which 
we have reconstructed 16,386 candidates for the decay 
B°->D*+e-De using only the D*~>D"'k+ decay of the 
D* and the D^^K~tt^ decay of the (the above also 
includes reconstruction of the corresponding charge con- 
jugate decay chain, and generally charge conjugation is 
implied everywhere throughout this document). 

We introduce here a novel method of extracting form- 



*Also with the Johns Hopkins University, Baltimore, Maryland 
21218 , USA 

tAlso at Laboratoire de Physique Corpusculaire, Clermont- 
Ferrand, France 

•tAlso with Universita di Perugia, Dipartimcnto di Fisica, Perugia, 
Italy 

''Also with Universita dcUa Basilicata, Potenza, Italy 
^Deceased 



factor parameters from the data. We use an unbinned 
maximum likelihood method, and introduce approxima- 
tions that allow us to correct for the efficiency and res- 
olution with the limited Monte Carlo (MC) data sample 
available. The impact of these approximations on our 
results is studied in detail. 

An important difference from the earlier analysis [1] 
is that in place of a linear parameterization of the q^- 
dependence of the form factors , used for their main 
results, we use a higher-order polynomial motivated by 
theory. The linear form was assumed in similar analyses 
(in which only iVcbl and the slope form factor were ex- 
tracted, but not all three form factors) by ARGUS [4], 
operating at the T(4S), and by ALEPH [5, 6] , DEL- 
PHI [7], and OPAL [8] at LEP. The need for higher-order 
terms has been indicated by recent data [9]. Further, 
this extension is also required by theoretical constraints 
[10], [11], [12]. The inclusion or exclusion of higher-order 
terms leads to very different values for the parameter 
(which is defined explicitly in Eq. (10) ), as will be dis- 
cussed further in Sec. X. In order to compare with pre- 
vious results, we also report the result of a fit obtained 
with a linear parameterization. 

An outline of the paper is as follows: In Sec. II, we 
define the observables, form factors, decay amplitudes, 
and parameters. Sec. Ill describes the relevant aspects 
of the detector that are most significant for this analy- 
sis. Event reconstruction and selection is described in 
Sec. IV, the Monte Carlo simulation in Sec. V, and the 
analysis method in Sec. VI. Sec. VII describes fit re- 
sults. Sec. VIII the goodness-of-fit method and tests, and 
Sec. IX the estimation of systematic errors. In Sec. X, 
we summarize the results and compare them to the pre- 
vious measurements. Appendix A covers technical de- 
tails of the pseudo-likelihood systematic error estimation 
method, and Appendix B gives more detail on the deter- 
mination of the levels of combinatoric and peaking back- 
grounds. 



II. FORMALISM 

This section outlines the formalism and describes the 
parameterization used for the form factors. More details 
can be found in Refs. [2, 3]. 

The lowest order quark-level diagram for the decay 
B^ D*~^£~17 is shown in Fig. 1. 
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A. Kinematic Variables 

A B-^D*£iy decay is completely characterized by four 
variables, three angles and q^, the square of the momen- 
tum transfer from the B to the D* meson. 

The momentum transfer is linearly related to another 
Lorentz invariant variable, called w, by 



Pb ■ PD' 



Ml 



MbMd' 2MbMd^ ' 

where Mb and Mb,* are the masses of the B and the 
D* mesons, pb and po* are their four-momenta, and vb 
and Vb* are their four-velocities. In the B rest frame the 
expression for w reduces to the Lorentz boost to the D* 
, 7_D* =Ed'/Md'. 

The ranges of w and are restricted by the kinematics 
of the decay, with q^ corresponding to 



Ml 



Ml, 



1.504 



and 



9r, 



2MbMd' 
1 corresponding to 

= {Mb - Mo'f w 10.69(GeV/c2)^ 



(2) 



(3) 



In this analysis we only reconstruct the decay 
£)*+^I?°7r+, where D^^K^tt^ . The angular variables, 
shown in Fig. 2, are 

• 9 1, the angle between the direction of the lepton 
(i.e.; for this analysis, the electron) in the virtual 
W rest frame, and the direction of the virtual W 
in the B rest frame 

• Ov , the angle between the direction of the D in the 
D* rest frame, and the direction of the D* in the 
B rest frame 

• Xi the dihedral angle between the plane formed by 
the D* — D and the plane formed by the W — i 
system. 



B. Four-Dimensional Decay Distribution 

The Lorentz structure of the B^D*£v decay ampli- 
tude can be expressed in terms of three helicity ampli- 
tudes and Ht^), which correspond to the three 



FIG. 2: Kinematics of a B^D*lu decay, mediated by a weak 
virtual intermediate vector boson W . This diagram defines 
the three kinematic variable angles 6i, Oy and x- 



polarization states of the D* (two transverse and one 
longitudinal). For light leptons these amplitudes are ex- 
pressed [2, 3] in terms of the three form factors: 



-{Mb + Md.)A,{w) + 2 „f^;^ff V{w) 



H+{w) = 
H_{w) = -{Mb + MdMi{w) 
Ho{w) = 



Mb + Mb- 
, Pd'Mb 
'Mb + M 



D' 



V{w) 



Mb + Md' 



MB'Jq^ 



{A^{w){MbAwMb - Mb-)) 



{Mb+Md>)' 



r^2(H) 



(4) 



where pB* 



1 is the magnitude of the mo- 
mentum of the D* in the B rest frame (for the r lepton, 
which has non-negligible mass compared to the other par- 
ticles in this process, a fourth helicity amplitude, iJj, 
would contribute [2] ). 

The full differential decay rate in terms of the three 
helicity amplitudes is [2, 3] 



dV 



iGl\V,b\^PD'q^ 



dq^ dcos 9i dcos 9v dx 



^Hll 



8(47r) 

[i7+(l -cos6l^)^sin^ ev+ 
i7i(i + cos 6*^)2 sin^ + 



ml sin^ Oi cos^ 9v 
-2H+H_ sin^ 0^ sin^ 0v cos 2x 
— 4iJ+i7osin0f(l — cos6'f)sin0v'cos6'y cosx 
+4:H^Hosm9e{l + cos9e)smdvcos9v cosx] , 

(5) 

where Sd-d = BF{D*+^D'^Tr+)x BF{D°^K-7r+) and 
where all three of the Hi are functions of w. The four- 
dimensional distribution of w, cosO^, cosOy, and x de- 
scribed by Eq.(5) is the physical observable from which 
we extract the form factors. 
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In this analysis we only deal with the shape and rela- 
tive normalization of the form factors, and consequently 
the overall normalization of the rate is irrelevant. 



C. Heavy Quark Symmetry Relations 

The functions Ri^w) and R2{w) are defined in terms 
of the axial and vector form factors by 



A2{w) 



_ R2{w) 2 



R 



*2 



1 



Aliw) 



and 



, Ri(w) 2 ... 
R*'^ w + 1 

where the constant R* is defined by 



R* 



2^MbMd' 
(Mb + Md-)' 



(6) 



(7) 



(8) 



Perfect HQS impHes that Ri{w) = R2{w) = 1, i.e., 
the form factors A2 and V are identical for all values of 
w and differ from Ai only by a simple kinematic factor. 
Since HQS is not exact, in general i?i and R2 will differ 
from unity and exhibit some w-dependence as discussed 
in Sec. II D. 

It is conventional to introduce 



1 



R*w + 1 



Aliw), 



(9) 



such that in the HQS limit, /i^^ is the Isgur-Wise function 
[13] and hA,{l)=^{l) = 1. 
The function can be parameterized in a number of 
ways. One simple empirical expansion in w — 1 is 



h-Aiiw) 
(I 



p'^{w - 1) + X{w 



(10) 



1)^ 



1)^ 



We will not fit with this general expansion as there are 
good theoretical constraints (see Eq. (11)), which relate 
all the higher-order parameters to the linear param- 
eter. Further, convergence issues arise with attempts to 
fit a free functional form with so many parameters (see 
Sec. VHB). 

Corrections to HQS modify hAi{l) and thus lead to 
deviations from the relation of /iai(1) = 1. In the base- 
line analysis we use a parameterization of Hai which con- 
forms to requirements of analyticity. This was first devel- 
oped and presented by Boyd, Grinstein, and Lebed [11], 
though the form used here employs a version of this pa- 
rameterization proposed by Caprini, Lellouch, and Neu- 
bert (CLN) [10] 



/lAi(z) 

[1-8 



= /lAi(l) X 

j^z + (53p2 - 



(11) 



15)z^ - (23I/92 - 91)23 



where 



\/¥TT- ^/2 



(12) 



The maximum value of z over the entire allowed range 
of w is Zmax = 0-04. In an expansion in z{w) to 0{w— 1), 
is the coefficient of the linear term. In this case, p^ 



dh A 



dw 



at w 



1. 



can be referred to simply as the slope 
However, in higher-order expansions of z{w), the coef- 
ficient of {w — 1)^, called the "curvature," will take on 
non-zero values, as will coefficients of {w — 1)^ and other 
higher-order terms. Thus, even though only the single 
parameter p^ is used, the functional form of Eq.(ll) im- 
plies a non- linear function for hAi{w). 

An alternative parameterization motivated by HQS is 



hAi (w) 



1 



2p- 



(13) 



Le Yaouanc, Oliver and Raynal have demonstrated that 
this form obeys the bounds on the derivatives they obtain 
in the HQS limit using a rigorous, general approach based 
on Bjorken-like sum rules [12]. 

We do not use this form in our analysis, but note that 
in the p^ range of interest (~ 1 — 1.5) it agrees with CLN 
to a few percent. Thus, p^ obtained using this parame- 
terization would be virtually identical to that we obtain 
using Eq.(ll). 



D. 



Form-Factor Ratios 



As discussed in Sec. II C, for infinitely massive h and 
c quarks, HQS predicts i?i = i?2 = 1 exactly. However, 
these simple relations are broken for finite 6 and c quark 
masses, because these ratios are modified by both per- 

turbative (ag-dependent) and non-perturbative ^-^^^ 

corrections, where rrix represents either the h or c quark 
mass. 

Calculating higher-order loop corrections to the form 
factors yields expansions of the form [14]: 



Ri{w) = 1 
R2{w) = 1 
The coefficients 



022 1 



- 0J2 



A 



QCD 



^QCD 

nix 



(14) 
(15) 



bij of the Us terms are complicated ex- 
pressions of the D* boost w which have been calculated 
perturbatively up to second order, to an estimated accu- 
racy of about one percent (see [14]). Though they are 
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not explicitly suppressed by a heavy quark mass 

corrections, they are all functions that approach zero in 
the w^l HQS limit. 

The coefficients uji of the ( ^^2^22. ] factors are called 
"subleading Isgur-Wise functions." Subleading Isgur- 
Wise function correction terms are evaluated somewhat 
differently in various models in the HQET framework, re- 
sulting in a variety of predictions for Ri{w) and R2{w). 

Perturbative quantum chromodynamics (QCD) and a 
variety of other theoretical tools have been employed to 
determine the behavior both at and away from the w = 
1 endpoint. Close and Wambach used a simple quark 
model [15] to find 

Ri{w) = 1.15-0.07(w-l), (16) 
i?2(w) = 0.91 + 0.04(w-l). (17) 

Calculations with HQET have produced a variety of re- 
sults. An early prediction by Neubert was [2] 

Ri{w) = 1.35- 0.22(io - 1) + 0.09(w- 1)2, (18) 
R2iw) = 0.79 + 0.15(10- 1) -0.04(u'- 1)2. (19) 

More recently, CLN [10] used spectral functions, disper- 
sion relations, and HQS to predict 

Ri{w) ^ 1.27 -0.12(u;-l) + 0.05(^-1)2, ^^O) 
R2iw) = 0.80 + 0.11(ui-l)-0.06(w- 1)2. (21) 

Ligeti and Grinstein [14] using similar HQET calcula- 
tional tools find 

Ri{w) = 1.25-0.10(w-l), (22) 
i?2(w) = 0.81 + 0.09(w-l). (23) 

It can be seen that in all the predictions the coefficients 
of the (w—l) and (w— 1)2 terms are small; this is because 
-Ri and i?2 are, by construction, ratios that are expected 
to vary only slightly with w, whereas is not subject to 
such a restriction. For the predictions above, Ri and i?2 
vary by 0.07 or less over the full w range. For this reason, 
for our baseline fit, we follow precedent in treating i?i 
and i?2 as constants, independent of w. We will however 
examine deviations from this baseline fit in Sec. VII. 

III. THE BABAR DETECTOR 

The BABAR detector is described elsewhere in de- 
tail [16]. This analysis uses four of the five subdetec- 
tors of BABAR: the silicon vertex tracker, the drift cham- 
ber, a Cerenkov-light-based particle identification detec- 
tor, and the electromagnetic calorimeter. The analysis 
depends critically on the silicon vertex tracker to recon- 
struct the low-momentum pions produced by the decay 



D*~^ D^Tr~^, about two-thirds of which do not tra- 
verse more than the first quarter of the drift chamber 
(as is commonly done, we refer to these as "slow" pions, 
henceforth denoted "tTs"). 

IV. RECONSTRUCTION AND EVENT 
SELECTION 

We reconstruct the electron track in the drift chamber 
and silicon tracker and identify it using particle iden- 
tification (PID) information from dE/dx measured in 
the drift chamber, photons captured by the Cherenkov- 
light detector, and energy deposited in the electromag- 
netic calorimeter. The D* is reconstructed through its 
decay to a low momentum pion (tt^) and a Z)*^, and the 
D'^ through its decay to Ktt. The hadrons are selected 
by similar PID information to that used for the electron 
identification. 

We then choose final cuts that select B-^D*ev decay 
candidates, and from the four-momenta of the observed 
particles we determine the kinematic variables w, cosOg, 
cos 9vj and x- 

A. Backgrounds and Event Selection 

1. Background Categories 

We address each distinct source of background with 
appropriate cuts (which are further described below in 
Sec. IV A 2): 

1. Combinatorial background: events in which the re- 
constructed D* candidates that were not originally 
actual D* mesons. These events do not contribute 
to the peak in the Am = mj^^+^- — mxTr dis- 
tribution. Cutting on Am provides discrimination 
against this background. 

2. Peaking background, for which the D* decay has 
been correctly reconstructed and which contributes 
to the peak in the Am distribution. This category 
is further broken into two main sub-categories: 

(a) D** background: Events where a true D* is 
combined with a electron from the same B'^ or 
B~ parent, but an extra pion in the decay has 
been missed. This is primarily feed-down from 
P-wave D meson decays, but also includes 
non-resonant B-^D* euX decays, where X is 
nTT (n > 1). As shorthand we call this "£)** 
background" after its dominant component. 
Note though that the four P-wavc D meson 
states are usually referred to by the shorthand 
term "£)** modes" in the literature, and thus 
our redefinition for the purposes of this paper 
of "£)** background" differs slightly from the 
standard terminology. 
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(b) Other events with a true D*: 

• Fake electron : Events in which a real D* 
is combined with a hadron instead of an 
electron. This background is minimized 
by requiring electrons to pass the most 
stringent electron-identification criteria. 

• Cascades : Events with a decay chain of 
the form B° or B- ^ D*X, X ^ eY 
(X is e.i?., another D* or Z?°, Y is any- 
thing), so that the observed electron is 
not primary (i.e., not directly from a B 
decay) . Secondary electrons have a softer 
momentum spectrum than primary elec- 
trons so that a minimum electron mo- 
mentum cut is effective against this back- 
ground source. 

• Uncorrclatcd electron : Events where a 
D* is combined with a electron from the 
other B. Thus, one side has a. B or 
B- D*X, and the other has S° or 

Ye. A cut on cos 6'By, the co- 
sine of the angle between the B and the 
D* — e combination, is effective against 
this background. 

• Continuum : e+e"^ cc events for which 
one c quark forms a D* while the other 
hadronizes into a state which decays 
semileptonically to create an electron 
(cc D*eX). The other continuum 
backgrounds {e^ e~ ^uu,dd, ss) are neg- 
ligible and almost none of them pass the 
final cuts. Cuts on the D* momentum 
and event topology are effective at sup- 
pressing this background. 

2. Event Selection Cuts 

For event selection we use the procedure developed for 
the BABAR Vcb analysis [9] . The most salient cuts are as 
follows: 

• The momentum of the electron in the center-of- 
mass (CM.) frame, which is denoted throughout 
this paper as , is required to be larger than 
1.2GeV/c. This criterion selects B semileptonic 
decays and suppresses continuum (e'^e~—>cc) and 
cascade [B^D^e) backgrounds. 

• The slow pion from the D* decay must have a trans- 
verse momentum pt greater than 50 MeV/c. This 
rejects the mostly fake tracks found below this cut. 
The efficiency for finding true pion tracks below 
50 MeV/ c is small as the majority of the pions stop 
before leaving enough hits in the vertex detector to 
be reconstructed. 

• The probability of the fit of the D*e vertex, in- 
cluding the beam-spot contraint, must be greater 



than 1%. This suppresses inclusion of particles 
from the other _B, e.g., those tracks in uncorrelated 
electron background events. 

• To further suppress continuum background, we se- 
lect only candidates with | cos6'thrust| < 0.85, where 
6'thrust is the angle between the thrust axis of the 
D*e candidate and the thrust axis of the rest of the 
event. 

• The cosine of the angle 9 by between the direc- 
tion of the B and the direction of the D* — i sys- 
tem can be computed from the kinematics of the 
B°^D*iu decay (see Sec. IV B). Candidates 
with cos 6'By between —10 and -1-5 have been used 
to estimate background. We include only events 
that have |cos6'By| < 1.2 in the final sample. The 
cut is set beyond the physical limits at ±1 to allow 
for spillover due to resolution. 

• The final selection is based on Am = TO/firTTs — 
THK-n (the difference between the reconstructed D* 
and D° masses). We require 0.143 < Am < 0.148 
GcV/c2. 




8699A15 



FIG. 3: Reconstructed particle directions and cos 6by in the 
CM frame. The momentum vectors Pd* and are mea- 
sured. Their sum is Py. The magnitude of the B momen- 
tum, pB = |Ps|, and the angle 6 by the B makes with Py 
are known, but the azimuthal orientation 4)by of Ps around 
Py is not known. The points at (j>BY = 0, tt are in the D* — £ 
plane. 



B. Determination of Kinematic Variable w 

Lacking a measurement of the neutrino momentum, 
we do not have sufficient information to fully reconstruct 
the kinematic variables w, cos 9e, cos Oy, and x- However, 
using energy-momentum conservation and assuming that 
the missing particle is a massless neutrino, we have 
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= ml = Ml + Ml - 2EbEy + IpePv cosOby, (24) 

where py = po* + Pi is the four momentum of the com- 
bined D* and electron, = py is the mass squared and 
pY is the magnitude of the Y three-momentum. The B 
meson energy Eb and three- momentum magnitude pB 
are known from the energies of the colhding beam parti- 
cles, so we can solve for cos6'by: 



The response of the BABAR detector is modeled us- 
ing a GEANT4-based simulation. [17]. The simula- 
tion has been extensively validated by comparison with 
large data control samples (such as slow pions from 
generic D*^D'^tt decays for the slow pion helicity stud- 
ies). Event generation and particle decay are modeled 
using the package EvtGcn [18]. 

A. Signal 



cos 9 



BY 



Ml 



M'l 



2EbEy 



(25) 



Thus we can determine the angle between the B and 
the direction (Y = py / pY , where pV is the three- vector 
momentum of the Y) of the D* — £ system, but we do 
not know the azimuthal angle (j^BV- This is illustrated 
in Fig. 3 where it can be seen that the direction of the 
B must lie on the cone centered on Y with the opening 
angle 9by- 

For each possible ^sy we can compute the kinematic 
variables it;, cos0£, cos fly, and x. Since the angle (fiBV 
is not measured, we average over four points: two in the 
£'*-clcctron plane corresponding to the azimuthal angles 
(j)BY = and TT and two points out of the plane corre- 
sponding to the angles ±7r/2. Further, since BB produc- 
tion follows a sin^ 6b distribution in the angle between 
the B direction and the beam collision axis in the CM 
(T(4S)) frame, we weight the kinematic variables evalu- 
ated at each point by sin^ 0b- 

Fig. 4 illustrates the resolution achieved by this tech- 
nique. The core widths for each resolution distribution 
are small compared to the full range of each kinematic 
variable. The resolution is dominated by the average 
over the B direction; detector resolution makes a rela- 
tively minor contribution. The low-side tail on cos 9i can 
be attributed to final state radiation. 

The resolutions of the four kinematic variables are 
highly correlated. Thus, we rely on Monte Carlo sim- 
ulation to account for resolution effects. 

The distributions of the reconstructed kinematic vari- 
ables w, cos9(, cosOv, and x from Monte Carlo sim- 
ulation are displayed in Fig. 5. The shaded region is 
the distribution of the background as estimated from the 
Monte Carlo simulation using the method described in 
Sec. VI below. The background contributions to the dis- 
tributions are much smaller than the signal contribution 
(on the order of 10-15%). 



V. SIMULATION 



To simulate the signal we use Eq. (5) for the distribu- 
tion of the decay products. The MC samples are gener- 
ated with the default parameters Ri = 1.180, R2 = 0.720 
and = 0.920 [19]. The MC generator uses a simple lin- 
ear expansion (Eq. (10) taken to order {w — 1)), so we 
must reweight the MC events to model a more complex 
behavior such as that given by Eq. (11). 



B. Detection of Slow Pions 

Of particular importance to this analysis is the mod- 
eling of the efficiency for detecting low-momentum pi- 
ons. This task requires detailed simulation work since 
low momentum pions are lost through the interplay of 
acceptance, decay-in-flight, and stopping and scattering 
in the beam pipe and vertex detector. 

To test this modeling we examine a large control sam- 
ple of inclusively produced D*~^ mesons. Fig. 6 shows the 
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This analysis is dependent on Monte Carlo (MC) simu- 
lation to model the efficiency and the background distri- 
butions. The degree to which the simulation reproduces 
both the detector response and the underlying physics 
processes largely determines the systematic errors. 



FIG. 4: Monte Carlo assessment of the experimental reso- 
lution for the variables w, cosOt, cos9v, and x- For each 
variable the difference between reconstructed and generated 
values is shown. The resolutions are small compared to the 
kinematic ranges of the variables as shown in Fig. 5. 
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FIG. 5: One-dimensional projections from Monte Carlo sim- 
ulation of kinematic variables w, cos 9i, cos9v, and x for se- 
lected events. The blank region represents signal, and the 
shaded regions are estimated backgrounds; the histograms are 
a sum of both the signal and the estimated background be- 
neath them. 

distribution of cosine of the helicity angle for the decay 
D*+ D°7r"*" reconstructed in both the data and the 
MC sample in bins of _D*+ ccnter-of-mass momentum. 
The helicity is defined so that slower pions correspond to 
backward helicity angle (and is ^ —cosOy)- 

The MC distributions have been fit to the data using 
a simple weighting factor of the form 

/corr ^N{1+^, COS 6^^ + Q COS^ 0, J , (26) 

where the normalization N and the parameters and Q 
have been obtained for each momentum bin j. The three 
lowest momentum bins are most relevant to B—fD*iv 
decays. The agreement between the weighted simula- 
tion and the data is excellent. Note the weighting factor 
Eq.(26) is not applied to correct the signal. Rather the 
weight coefficients obtained by fitting this control sample 
measure the MC-data differences in each momentum bin. 

The linear terms, ^j, arise from inadequacies in the 
simulation of the detector response. The linear terms for 
the first three bins are small: 0.037±0.160, -0.023±0.024 
and — 0.016 ± 0.009, indicating good agreement between 
the data and Monte Carlo simulation efficiencies in the 
complicated arena of slow pion modeling. Note that crit- 
ical details of the distributions that arc unaffected by 
the weighting scheme (such as the evident thresholds in 
the low momentum bins) are reproduced correctly by the 
MC. 

The quadratic terms arise mostly from incorrect simu- 
lation of the polarization of the Z?*-mesons. The values 



FIG. 6: Events vs. the cosine of the slow pion angle relative to 
the line-of- flight of the D* , shown for bins of CM momentum 
of the D* . The momentum ranges for each plot, in GeV/c, are 
indicated in the upper left-hand corners. Data (points) and 
MC simulation (histogram) are shown for slow pions from the 
decay of inclusively produced D* mesons. The vertical scale 
is arbitrary, but corresponds to the number of background- 
subtracted reconstructed D* candidates in a given bin. Only 
the differences between data and MC points are relevant for 
our purposes. The errors are comparable to the dots rep- 
resenting the data. The simulation has been weighted by 
restricted functions to fit to the data as described in the text. 



of ( are large (^0.5 and 0.7) in the momentum bins for 
0.5 — 1.0 GeV/c and 1.0 — 1.5 GeV/c, where we expect 
£'*-mesons to be highly polarized products of few-body 
_B-meson decays. These terms do not indicate any prob- 
lem with detector simulation, and only affect a tiny frac- 
tion of the events in our final sample (the fake electron 
background) . 

Further, as the MC and data agree well for the linear 
terms, there is no need to make corrections to account 
for differences between simulation and data. This is fur- 
ther indicated by the lower right plot of pt,TTs shown in 
Fig. 12, which indicates excellent agreement between the 
slow pion transverse momentum distribution after the fi- 
nal fit; but with no linear weighting terms applied to the 
slow pion momentum. 

As the extraction of form factors (especially p^) is 
highly sensitive to the slow pion simulation we do add 
a contribution to the systematic error to account for the 
possible residual differences between the data and the 
MC simulation. We describe our method for doing this 
in Sec. IX A 3. 
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C. Final State Radiation 

The program PHOTOS [20] is used to model the effects 
of final state radiation (FSR). PHOTOS uses quantum 
electrodynamics (QED) to second order in aem (up to 
two FSR photons can be produced) and is known to pro- 
vide a quite accurate simulation of the FSR effects [21]. 



D. Other Semileptonic Decays 

A major source of background is other semileptonic 
B decays. Only the branching fractions for the decay 
modes Dlv, the signal mode D*£i/, and that for the 
mode B—>Di£v have been measured [22] (i.e., only one 
of the four P-wave D meson states) . The other branching 
fractions (including the three other P-wave D states and 
all non-resonant decays) used in the MC simulation are 
based on models with large theoretical uncertainties. We 
use fits to the data to constrain these branching fractions 
as shown in Sec. IX. to minimize their contribution to the 
systematic background error. 



VI. ANALYSIS METHOD 
A. Fitting 

Our approach to extracting the form factors is to per- 
form an unbinned maximum likelihood fit to the full 
four-dimensional distribution function (PDF) specified 
by Eq. (5). We parameterize the form factors in terms 
of the parameters Ri, i?2 and as described in Sec. II. 
The dependence is specified by Eq. (11). Since the- 
oretical predictions show only a mild dependence of i?i 
and i?2 on w (sec Eqs. (16-18), we first perform the fit 
treating them as constants over the entire range of w. 
We later show (in Sec. VII) how the results vary when 
the w-dependencies suggested by the Eqs. (16-18 are im- 
posed. 

In addition to assuming its form, we must account for 
the effects of resolution and efficiency on the measured 
distribution. We adopt approximations that enable us to 
carry out the maximum likelihood fit to the form factor 
parameters efficiently with minimal loss in precision. 



1. Resolution-Efficiency Correction Method 

To account for the efficiency and resolution effects we 
adapt the approach first employed in the angular analy- 
sis of the decay B ^ J/ i/jK* [23] (for more details of the 
technique see also [24]). The full PDF (J^) including res- 
olution and efficiency is given in terms of the theoretical 
PDF (F) of Eq. (5) by 



T{x]ii)= J dx e{x) G{x;x) F{x; fi), (27) 

where x represents the true variables (w, cos 9i, cos By, 
and x)j S are the observed values of the variables and 
^ represents the parameters (i?i, i?2 and p^) that deter- 
mine the form factors. The efficiency e{x) is the fraction 
of events with parameters a; that are detected and G{x; x) 
is the probability density that an event with true param- 
eters x is reconstructed with parameters x. 

The logarithm of the likelihood L that we need to max- 
imize is given by 

i 

where the integral 

I(/Lt) = J dxT{i; fi) (29) 

is required to normalize the likelihood in the presence of 
imperfect acceptance. The sum is over our data sample 
of iVdata events. 

While the distribution J-'{x, ^) can be simulated by 
Monte Carlo, it is not practical to use it directly as the 
function to be varied in the maximum likelihood analy- 
sis. Instead, we will introduce a method in which we can 
use Monte Carlo simulation data without any need to 
extract a detailed model of the efficiency and resolution 
function. 

We now try the approximation 

•?^(i;Ai) = /(5;m) = •^(i;Mmc)T^-^^^^^- (30) 

where is the parameter set used for generation of 
the Monte Carlo sample. If we had used the true values 
of the parameters fXt in place of pmc, then the maximum 
likelihood method would converge to the true values since 
by inspection the trial function /(i; fj,) would become the 
true distribution J-{x; fi). Of course the true values are 
not known. The use of fj^mc introduces a bias propor- 
tional to the difference between the Monte Carlo param- 
eter values and the true parameter values, which can be 
calculated explicitly in the limit of high statistics. 

When the approximation /(i; /i) of Eq. (30) is substi- 
tuted into the expression for the log-likelihood Eq. (28), 
it yields 

Ini = '^\nF{xi;fj,)-^\nF{xi;fi„^c) (31) 

i i 
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where X(/x,/ii„c) is the integral of the approximation of 
Eq.(30). 

Since terms that are independent of the fit parameters 
(constant terms) do not affect the point at which the 
maximum will be found, all the sums that depend only 
on /.imc can be dropped [i.e., the central two terms of 
the total sum) . The /i-dependent piece has been factored 
from these constant terms. We are left with a likeli- 
hood function that depends only on the theoretical PDF 
F and on the integral over the resolution and efficiency 
functions. 

The resolution and efhcicncy occur only through the 
normalization integral ^mc)- 

Using the technique of Monte Carlo integration to eval- 
uate the integral /imc) gives 



(32) 



■E 



F{xi , /^nic) 



The MC simulation generates events in proportion to 
T{x; ^mc) so the sum over Monte Carlo events approxi- 
mates the desired integral. The small error introduced by 
the Monte Carlo evaluation of the normalization integral 
is determined in Appendix A. 

In Appendix A, we demonstrate that the bias intro- 
duced by the use of the approximate form Eq.(30) is given 
by 



6 



(^mc — l^t)bMba 



where 



and 



Mba = JbcEc. 



(33) 



(34) 



a in Eq. (34) should be much less than one. Numeri- 
cal evaluation yields values of ~ 0.15 for Ri and i?2 and 
^ 0.03 for . If /inic — A't is comparable to the error, the 
residual "bias" should be small, and in practice is found 
to be so. 

To achieve /^mc ^ Mt we reweight the MC used in the 
efficiency integral computation to the fitted values of the 
parameters and iterate until the fit values converge. At 
this point /imc = M/itted and should deviate from the 
truth by an amount comparable to the error. In effect the 
residual deviation acts like a small increase (~ 2%) in the 
statistical uncertainty. The iteration method works well 
and converges quickly, as discussed further in Sec. VII C. 



2. Speeding Computation of Normalization Integral 
Through Moments Factorization 

Because the normalization integral I(/i, /^mc) depends 
explicitly on /j, it must be recomputed for every iteration 
of the procedure that maximizes the log-likelihood. To 
do this by Monte Carlo integration over the full decay 
phase space would be prohibitively slow. 

For the signal distribution, this integration can be 
avoided. Since the signal PDF can be written in the 
following form. 



F{i;^i) 



(37) 



i.e., as sum over a product of terms depending only on 
the fit parameters and terms depending only on the kine- 
matic variables, we can define moments Ma by 



Mr. 



(38) 



where the sum is over reconstructed MC events, i.e., the 
same sum that defines X(/i, /imc) in Eq. (32). This allows 
us to write X(/i,/imc) as a sum over moments: 



dln^dhiF 
dluFdlnF 



and where averages are defined by 





{A) 



J dxF{x, ^t)A{x) 
J dxF{x, //t) 



(36) 



The bias vanishes if the parameters used in the Monte 
Carlo coincide with the true values. Moreover, the bias 
vanishes insofar as the ratio !F /F of the smeared distribu- 
tion to the unsmeared distribution is independent of the 
parameters /t. Since T and F probe nearby regions, the 
derivative of their ratio will be small, so the coefficient 



X(/i, /i„ 



^A„(/i) X M„. 



(39) 



The moments can be computed once before fitting and 
then taken as input to the fit. Thus, in the fit, the time- 
consuming sum over weighted events is replaced with the 
sum over moments. Taking the expansion of Ha-^ to order 
{w — lY or z^, we have 42 moments to compute and sum. 



3. Background Subtraction Through Pseudo-Likelihood 

To handle the background, we would ordinarily add 
a PDF B{x) that models the background. The PDF F 
would be replaced with 



fF{i-ii) + {l- f)B{i), 



(40) 
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where / is the signal fraction. However, since we do 
not have a form for the background distribution before 
acceptance, we cannot achieve the factorization of the 
parameter dependence from the efficiency and resolution 
functions that leads to Eq. (39). To avoid this prob- 
lem we use the technique of subtracting Monte Carlo 
events representing the background directly in our likeli- 
hood sum rather than adding it to our PDF. We replace 
our log-likelihood function with the following 'pscudo- 
likehhood' [25] 



\nA = lnF(x«,;M)- (41) 

iGdata 

E ^bkgdlnJ^(xi^gd;/^) - iV.ignailnX(/x), 
jeMC 

where the first sum is over the data and the second is 
over a Monte Carlo sample representative of the back- 
ground. The weights W^l^gd account for any difference 
between the background in the data and in the Monte 
Carlo. They are computed in the manner indicated by 
Eqs. (42) and (44) in Sec. VI C below. The coefficient 
of the normalization integral is iVgignai ~ ^data — -^bkgd, 
where iVbkgd is equal to the number of events subtracted, 
accounting for the weights. It is easy to show that if the 
Monte Carlo simulation of the background is accurate, 
the procedure is unbiased. The statistical errors can be 
computed by a modification of the procedure used for an 
ordinary log-likelihood analysis. 



B. Methodological Error Contributions 

The approximations outlined above provide a fast and 
easily-implemented fitting procedure that uses the MC 
sample without any need to extract a detailed model of 
the efficiency and resolution functions. However, these 
advantages do not come without a cost: we must ac- 
count for the uncertainty introduced by the resolution- 
efficiency and pseudo- log-likelihood procedures. 

There are three other contributions to the error that 
are not accounted for in the fit. Two of these are 
Monte Carlo statistical in nature: the error from the 
pseudo-log-likelihood subtraction and the error from the 
Monte Carlo integration used to evaluate X(/i, /imc) in the 
resolution-efficiency correction procedure. There is also a 
contribution to the statistical error from the fluctuations 
in the background that are not accounted for in the fit. 
The first two can be reduced by increasing the size of the 
Monte Carlo sample, but the latter error is irreducible 
and must be included regardless of the size of the Monte 
Carlo sample. 

We have developed procedures for evaluating these 
three errors. The formulas used are collected in Appendix 
A. 



C. Background Level Estimation 

The categorization of backgrounds has been detailed in 
Sec. IV A. The combinatorial and peaking backgrounds 
are estimated by fitting the measured Am = m^^+^- — 
niKTr and cos^sy distributions in data. The Am fit 
determines the combinatorial background. The cos 9by 
distribution is then fit to a combination of signal, combi- 
natorial background, "£)**" peaking and other peaking 
background using the binned likelihood fitting method 
of Barlow and Beeston [26]. The shape of the cos6'By- 
distributions are taken from the Monte Carlo simulation. 
In the fit, only the signal fraction and the portion of 
the peaking background due to D** and D*Xev decays 
are free parameters. The combinatorial fraction is set to 
the value obtained from the Am fit and the other com- 
ponents of the peaking background are scaled from the 
Monte Carlo. See Sec. IV A 1 for more details. 

We apply the above procedure first to obtain the over- 
all fractions for the combinatorial (/comb) and the D** 
{Jd") backgrounds. These are obtained by fitting to 
the full data sample. The fraction of the consolidated 
remaining peaking backgrounds (/othcrPk), detailed in 
Sec. IV A 1, is obtained by direct scaling from the MC 
simulation. 

The weight needed to subtract the correct amount of 
each background type is given by 

ata / Ac\\ 

VVtypo = T- , I4^j 

JVtype 

where type=com6, D** or other depending on the partic- 
ular background type being subtracted and A'typo is the 
number of Monte Carlo events of that type available. 

The determination of /comb and /d** is discussed in 
more detail in the next two subsections. 



1. Determination of Level of Combinatorial Background 

The combinatorial background is due to D* candidates 
that were not originally actual D* mesons. It includes 
candidates where the D is properly reconstructed, but 
is paired with a random tt; events in which the tTs is 
in fact from the decay D* DtTs, but the D is not 
properly reconstructed; and candidates which are purely 
combinatoric (neither the D nor the tt^ are correctly re- 
constructed) - i.e., where any one or more of the three 
detected particles does not originate from the decay chain 
D*^Dns^{KTT)TT,. 

To model this background we use the following func- 
tional form 

(43) 

where below the D* threshold Amthr we take /(Am)=0. 
This is an extension of the commonly used threshold 
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function [9]. To the single scale parameter s the power 
K and the factor (^ ^^^^ ^ have been added as extra 
degrees of freedom. This extension allows us to obtain 
a better fit to the background and allows the fit enough 
freedom to account for the uncertainty in the background 
shape. Fixing k = 1 and 7 = corresponds to the usual 
unextended threshold function (further details of the ne- 
cessity of this extension are given in Appendix B). 

The signal distribution is fit by three free Gaussian 
functions plus a fixed tail of two wide Gaussians. The tail 
is fixed by fitting pure signal Monte Carlo as described 
in Appendix B. 

The final fit to data is shown in Fig. 7. 



background fractions, are free in the fit. The combina- 
torial background is input from the Am fit. The other 
peaking backgrounds are scaled from the Monte Carlo 
simulation as described above. 
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FIG. 8: The cos Oby data distribution and tlie result of the 
fit with the estimated signal and background contributions, 
as described in the text. Shading indicates the various com- 
ponents in the fit. 

For input to the form-factor fit we need the background 
fractions within the cos by signal window (|cos0By| < 
1.2). These are /comb = 5.33 ± 0.26% for the combina- 
torial, fD- = 4.85 ± 0.35% for the D** and /othcrPk = 
7.03±0.45% for peaking backgrounds other than the D**. 



FIG. 7: The fit (solid line) to the Am distribution for D*ev 
candidates as described in the text. The distribution is shown 
logarithmically to emphasize details of the functional fit, but 
the actual level of background is more clearly seen in the linear 
data plot shown in Fig. 16. 



2. Determination of Level of D** Background 

To estimate the peaking background we use a mixture 
of Monte Carlo predictions and a fit to the cos 9 by dis- 
tribution within the D* signal window Am = 0.143 — 
-0.148 GeV/c^. We fit the cos 6'by distribution for the 
signal and the background due to decays of the type 
B D*+Xev (called the "D**" background, as defined 
in Sec. IV A 1). The shapes of the cosOby distributions 
for the signal and the backgrounds are obtained from the 
Monte Carlo simulation. The backgrounds other than 
the D** are fixed at values obtained by scaling from the 
Monte Carlo by the appropriate luminosity ratios. In 
the case of fake electrons (which are mostly misidentified 
pions), we also scale using the misidentification proba- 
bilities obtained from the data control sample of pions 
(specifically from the BABAR h^tt^h~Vt- dataset). 

Figure 8 shows the results of this fit to the full data 
sample. The shading indicates the background source 
or signal. Only two paramaters, the signal and the D** 



3. Dependence of Background Levels on Kinematic 
Variables 

In addition to obtaining overall background fractions, 
we perform our Am-cos 9 by fitting procedure in five bins 
for each of the kinematic variables in the data. This al- 
lows extraction of the dependence of combinatorial and 
D** backgrounds for each kinematic variable. Wc com- 
pare the ratio of fitted background yields in data and 
MC, and parameterize the difference as a linear function 
of the kinematic variables. This allows application of a 
correction term to the weight for each Monte Carlo event 
of the form: 

IF-p- = (l + a^(u;-(^«),ypJ) (44) 
x{l + acosee{cos9e ~ {cos9e)^ypJ) 
x(l -I- {cos9v — (cos 6'y)jypj,)) 

x(l + ax(x- (X)typc))> 

where the means for each type (which arc calculated from 
the Monte Carlo simulation distributions) are subtracted 
to keep the normalization independent of the slope. 
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To extract the dependence of combinatorial back- 
ground on the kinematic variables, the procedure of fit- 
ting four Gaussians for the signal (discussed in Appendix 
B) and the extended threshold function for the combina- 
torial background portion is repeated for five bins in each 
of the four kinematic variables. 

To obtain the slopes needed in Eq. (44) we form the 
ratio of the background estimate in each bin to the num- 
ber found in that bin in the Monte Carlo simulation and 
fit it to a straight line. All of the fits are displayed in in 
Fig. 19 Appendix B. 

This information is summarized and shown in Fig. 9, 
and the numerical results for the slopes are given in Ta- 
ble I along with the means, which are needed to do nor- 
malization independent weighting, as discussed above af- 
ter Eq. (44). 
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FIG. 9: Ratio of data to Monte Carlo events for combinatorial 
background for the four kinematic variables w, cos Be, cos By, 
and X, obtained from 5-bin fits to data and Monte Carlo in 
each kinematic variable. The lines are result of linear fits (see 
text). 



TABLE I: The fitted slopes to the distributions shown in 
Fig. 9 of the ratio of data to Monte Carlo for the combi- 
natorial background in the four kinematic variables w, cos 9e, 
cosBv and x, and the mean values of the Monte Carlo distri- 
butions for these variables. 



Variable 


Slope 


Mean 


w 


0.51 ±0.50 


1.23 


cos 9i 


0.05 ±0.11 


0.31 


cos 9v 


-0.20 ±0.09 


0.00 


X 


-0.13 ±0.06 


1.48 



As in the combinatorial case, we repeat the extraction 
of the D** component of the peaking backgrounds in five 
bins in each of the four kinematic variables. The results 
of all these cos fits are shown in Fig. 20 in Appendix 
B. This is summarized in the linear fit for data to Monte 
Carlo events ratio for the D** background (the compo- 



nent free in the fit), which is shown in Fig. 10, with the 
numerical results in Table II. 

The linear fits in Fig. 10 give the slopes used in weight- 
ing the backgrounds using Eq. (44). 

We use the central slope values to reweiglit our back- 
ground subtraction and propagate the errors into the sys- 
tematic uncertainty by varying the slopes within their 
errors. 
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FIG. 10: Ratio of data to Monte Carlo events for D** back- 
ground for the four kinematic variables w, cos Be, cos By, and 
X, obtained from 5-bin fits to data and Monte Carlo in each 
kinematic variable. The lines are result of linear fits (see text) 



TABLE 11: The fitted slopes to the D** distributions shown 
in Fig. 10 of the ratio of data to Monte Carlo for the combi- 
natorial background in the four kinematic variables w, cos Bi, 
cos By and x, and the mean values of the Monte Carlo distri- 
butions for these variables. 



Variable 


Slope 


Mean 


w 


-1.35 ±0.52 


1.28 


cos Bi 


0.14 ±0.29 


0.35 


cos By 


0.18 ±0.12 


-0.11 


X 


0.07 ±0.08 


1.53 



Further details of the fits and estimates for the back- 
ground fractions and weights are given in Appendix B. 

D. X Averaging 

Few data events arising from true signal will be found 
at values of the kinematic variables for which the PDF 
goes to zero. However, there is no such limitation on the 
number of data events arising from backgrounds that may 
fall at these points. Because of this, when the logarithm 
of the PDF is evaluated for these points as part of the 
background-subtracted computation of the log- likelihood 
it develops spuriously large (negative) values, driving the 
fit far astray from the actual minima. 
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To deal with these spurious zeros we deliberately intro- 
duce a small amount of artificial resolution that prevents 
the PDF from going to zero. This resolution is chosen to 
be small compared to the natural resolution of our kine- 
matic variable reconstruction (see Fig. 4) and so has little 
effect on the sensitivity of our measurement. Any small 
bias introduced is removed by the resolution-efficiency 
correction method in the same manner as that due to 
naturally occurring resolution. 

We implement this artificial resolution by evaluating 
the PDF for all data points (including the data, Monte 
Carlo-selected background, and integration samples, for 
consistency) at values of the phase space not exactly at 
the given point, but offset from it on both sides in x by 
±0.1. The kinematic variable x is chosen because it is 
at particular discrete, parameter-dependent values of x 
that the theoretical PDF is zero. We then take the values 
of the PDF for these two offset points and average them. 
This average is used as the value for the given point in 
evaluating its contribution to the likelihood. 

Testing on Monte Carlo simulation shows x-averaging 
does not bias the fit values, nor increase the errors. Vary- 
ing the value by which we move x &n^d that ±0.1 is 
small enough to not bias the fit values, and large enough 
to eliminate the spurious zero problems fully, so we eval- 
uate all fit results with this offset. 



VR1-R2 = -0.77, 

VR^-p- = +0.60, (46) 
Vr2-p^ = -0.76. 

As we do not yet have enough statistical sensitivity to 
fit for the w-dependence of Ri{w) and R2{w), we con- 
sider instead the effect of the theoretically predicted de- 
pendence on the result. Parameterizing this dependence 
as follows 

Ri{w) ^ Ri + ai{w - I) + I3i{w - If , (47) 

R2{w) ^ R2 + a2{w - I) + hiw - If , (48) 

and inserting these w-dependent forms into the PDF 
(with fixed ai , Pi from the theoretical predictions) and 
fitting for the constant terms Ri and R2 we find the 
results given in Table III. 

These theoretical variations in R\{w) and R2{w) yield 
slightly larger values for i?i and slightly smaller values 
for i?2. However, this is presumably only because they 
all decrease the mean of -Ri {w) and increase the mean of 
R2(w) when averaged over the w spectrum. 



VII. RESULTS 
A. Free Parameter and Baseline Fits 

We perform the baseline fit (see Sec. TIC) taking Ri 
and i?2 to be independent of w and use the single param- 
eter form factor description of Ref. [10] given in Eq. (11). 

We find 



Ri = 1.396 ±0.060 ±0.035, 
i?2 = 0.885 ±0.040 ± 0.022, (45) 
p2 = 1.145 ± 0.059 ±0.030, 

where the first uncertainties given are due to the limita- 
tions of the data statistics and the second to limitations 
of statistics of the Monte Carlo simulations. Systematic 
uncertainties are discussed in Sec. IX below. The errors 
are highly correlated. The error matrix for the full sta- 
tistical error for i?i , i?2 , and (including Monte Carlo 
statistical) is: 

0.00479 -0.00243 0.00277 
-0.00243 0.00207 -0.00231 
0.00277 -0.00231 0.00447 



The correlations are 



B. Higher-Order Free Parameter Fits 

We performed a study using a large MC sample in 
order to ascertain whether it was also useful for us to do 
free fits to Ha^ with up to five parameters (see Eq. (10)) 
for a sample generated with a particular form (the CLN 
form). To this end, we created a MC sample of a million 
events generated with the ansatz of Eq. (11) with no 
acceptance cuts or resolution smearing applied. We then 
attempted to fit this using four parameters ( i?i,i?2, and 
coefficients to O {w ~ if in Hai) and five parameters ( 
i?i,i?2, and coefficients to O {w — if in /iai)- The fit 
results for the slope and curvature (for the four parameter 
fit) and slope, curvature, and cubic term coefficient (for 
the five parameter fit) were far from the generated values 
corresponding to the (if — 1)" expansion of the input 
(Eq. 11). 

This problem arises because the fit is attempting to 
compensate for the missing higher-order terms. As a re- 
sult the coefficients oi (w — l) or {w — lf returned by the 
fit cannot be cleanly interpreted in terms of the theoreti- 
cal expectations or interpreted in terms of limits such as 
those given in [12, 27]. 

Thus, while either the CLN or free polynomial expan- 
sion are a priori valid, the former is preferred from the- 
oretical constraints, and we see from the above that it 
is not useful to attempt to fit a sample with a free ex- 
pansion to a reasonably low expansion order and then 
interpret the result in terms of CLN coefficients. 
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Reference 


Oil pi 


a2 


a 

P2 


rCl 




'2 

P 




Baseline 


0.0 0.0 


0.0 


0.0 


1.40 


0.89 


1.15 


Caprini- 


-Lellouch-Neubert [10] 


-0.12 0.05 


0.11 


-0.06 


1.42 


0.87 


1.12 




;eti-Grinstein[14] 


-0.10 0.0 


0.09 


0.0 


1.42 


0.87 


1.11 




Neubert[2] 


-0.22 0.09 


0.15 


-0.04 


1.45 


0.86 


1.09 



TABLE III: Dependence of form-factor parameters on theoretical assumptions about slope (a) and curvature of R\ and R2 
w-dependence. See Eqs. (47) and (48). 



C. Iterative Convergence Studies 

We have studied the convergence of the likelihood max- 
imization procedure described in Sec. VI. A. Varying the 
initial seed values by as much as 50% of the final values 
still leads to convergence to the same final values within 
0.0005 for each parameter within five iterations. 

VIII. GOODNESS-OF-FIT 

To assess whether the results of the fit reproduce the 
distribution of the kinematic variables in the data, we use 
a binned method of estimating goodness-of-fit. Cher- 
noff and Lehmann [28] have shown that while a binned 

will in general have a wider than expected distribution 
when the free parameters are optimized using a likelihood 
fit, the effect is small for a large number of bins. There- 
fore we adopt the binned as our primary goodncss-of- 
fit test. 

Since we do not have an explicit form for the 
acceptance-corrected PDF {T) of the four reconstructed 
variables w, cosOg, cos0y and x, we reweight the MC 
sample to construct the distributions expected from our 
measured parameters. That is, the contribution of the 
signal to a bin is given by the MC event sum 

^signal = E ^iSnaP (49) 

ieMC 

where 

W'iijnal = f signal X iVdata X (50) 

and in this case = is the weight needed to 

r \ X i , flrnc ) 

modify the distributions from those generated with fi^c 
(i?i = 1.18, i?2 = 0.72, and = 0.92), to those obtained 
from this analysis and /signal = -/otherPk-/comb 
is the signal fraction. 

For the background we use the same weighting proce- 
dure (see Eqs. (42) and (44)) used in the fit. Using these 
weighting procedures the normalizations of the data and 
reweighted distributions match by construction. 

We consider two types of goodncss-of-fit: 

• A four-dimensional binned based on a total of 
1296 bins, that is, 6 x 6 x 6 x 6 bins, six each for 
the four kinematic variables 



• One-dimensional projected distributions of w, 
cos 9 1, cos9v, and x, as well as the distributions 
of the CM electron momentum (p|) and the trans- 
verse momentum (pt ) of the slow pion from the D* 
decay. 

A. Four-dimensional Binned 

Six bins were chosen for the x^ to make sure we have 
adequate statistics in each bin while still having sensitiv- 
ity to the shape predicted by Eq. (5) combined with de- 
tector and event selection acceptance. The kinematically 
empty regions shown for Monte Carlo events in Fig. 11a 
for w > 1.08 and backward cosO^ (due to the electron 
momentum cut) and in Fig. lib for small w and for- 
ward cos 9v (due to slow pion acceptance) are excluded 
by simple cuts on minimum cos Bg and maximum cos 0v 
as functions of w, respectively. 

The x^ is given by 

where the sum is over the 1296 bins, rii is the number of 
data events in bin z, rrii is the weighted number of Monte 
Carlo events and CTdif is the error on the difference ni—rrii. 
There is, of course, an error on our estimate of the error 
(Tdif too, that makes the x^ contributions from low stat- 
ictics bins vary widely. Thus, estimating ijdif is critical 
to obtaining a reasonably behaved x^ distribution. 

Using a simple Monte Carlo simulation divided into 
1296 bins that roughly mimics the number of low statis- 
tics bins in our dataset, we can achieve a fiat x^ probabil- 
ity distribution if we base the error on the best estimate 
of the number of events in each bin computed using the 
overall data-to-Monte-Carlo events ratio . That is rrii 
is given by muliplying the number of MC events n™'^ by 

^mc ■ 

Specifically, we use n\'^^^ — fru + (1 — f)mi where 
/ = rmc/(l + ?'mc)- The difference error (Tdif is obtained 
by using yri^^ for the contribution from the data and 
by scaling the error on rrii by Vn^°"Vm,. Thus, we use 
the hypothesis we are testing (that the weighted Monte 
Carlo follows the same distribution as the data) to ob- 
tain the best possible estimate of (Tdif by using the best 
estimators we can obtain for the number of data events 
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^j^best^ and for the number of MC events {n^^^^ /r^c) to 
estimate their respective contributions to the error. This 
procedure reduces the number of bins where the error es- 
timate fluctuates low leading to artificially high con- 
tributions. 

For the baseline fit wc find 



X 



1336.66 



(52) 



for 1292 degrees-of- freedom (1296 bins minus three free 
parameters and one for normalization), which yields a 
probability of 19% indicating the fitted parameters yield 
a good description of the full four-dimensional distribu- 
tion of the data (note that we keep two decimal significant 
digits in since every one unit change in correponds to 
a full one sigma shift in the fit parameter). 



B. One-dimensional Binned Projection Plots 

For a more detailed examination of how well wc fit 
the data, we turn to one-dimensional projection plots. 
Fig. 12 shows the weighted Monte Carlo (histogram) 
overlaid on the background-subtracted data. The back- 
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FIG. 11: Plots showing empty regions (due primarily to cuts) 
in the phase space of the kinematic variables w, cos 9i, and 
cos6'v using the Monte Carlo sample: (a) shows cosOe vs. 
w, where the high w, low cosOg region is removed in the 
goodness-of-fit calculation; (b) shows cos^y vs. w, the high 
cos6'v, low w region is removed in the goodness-of-fit calcu- 
lation. 



grounds arc subtracted in the one-dimensional analog to 
the method used in the fit, as detailed in Sec. VIC. 

The difference divided by its error ( "pull" ) is shown be- 
low each plot. The agreement is very good. The and 
the corresponding probability for each variable is sum- 
marized in Tabic IV. We take the number of degrees-of- 
frccdom (ridof) to be the number of bins minus one for 
purposes of estimating the probabilities. 

To test our sensitivity to the interference terms in 
Eq. (5) we make the same kind of comparisons in six bins 
of the angle x for the variables cosOi and cosOy- Since 
the normalization is not fixed in these plots the number 
of degrccs-of- freedom is 20. The cosd^ plots are shown 
in Fig. 13. The cos^y plots are in Fig. 14. In Table V 
we collect the and its probability for each plot. Again 
the agreement is excellent indicating that the fit succeeds 
well in reproducing the details of the distribution. 

In these plots we can also see the effect of isolating 
specific portions of the PDF - e.g., in the very low x 
region, cosx^l and wc obtain the modulation effect of 
the sin^ycos^y in the fifth term of the PDF, seen in 
the top left plot in Fig. 14. On the other hand, in the 
high X region, cosx-^ — 1 and this effect flips sign due 
to the sixth term, as can be clearly seen in the bottom 
right plot. (These types of effects also occur in Fig. 13 
but they are masked by the effect of the cut, which 
suppresses the low cos^f region strongly in that case, so 
the modulations are more difficult to observe). Much of 
the information content of our data is encoded in the 
interference terms, so it is important to ensure that we 
reproduce their effect accurately. 



TABLE IV: and x^-probability for kinematic variable pro- 
jections and electron momentum. The number of bins in these 
histograms is either 20 or 16. 

variable x''/^dot probability 



w 


22.0/19 


28% 


cos 6 1 


23.0/19 


24% 


cos 6v 


31.8/19 


3.3% 


X 


13.0/19 


84% 


* 

Vii 


17.3/19 


57% 


Pt 


9.0/15 


88% 



IX. SYSTEMATIC STUDIES 

The systematic uncertainties on the three parameters 
Ri, i?2 and of the baseline fit (see Sec. II C) arc sum- 
marized in Table VII. The systematic errors are compa- 
rable for fits which include variation in w of Ri and R2, 
such as those shown in Sec. X. 

A major source of systematic error arises from the MC 
simulation, specifically, from the modeling of the detector 
resolution and efficiency. We arc particularly concerned 
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TABLE V: and x^-probability for cos 9t and cos 9v overlay 
plots in six bins of the kinematic variable x- These values 
correspond to the numerical evaluation of one-dimensional x^ 
of Figs. 13 and 14. 

variable x cut X^ I^Aoi X^ probability 



cos Qi 


0<X<| 


18.4/20 


56% 


cos Qi 


f <x<f 


19.3/20 


50% 


cos Qi 


„6 X i 6 


29.6/20 


7.7% 


cos Qi 


6 — X 2: 6 
6 — X i 6 


17.9/20 


59% 


cos Qt 


23.9/20 


25% 


cos Qi 


if <x<^ 


12.4/20 


90% 



cos t^v 
COS Qv 
cos dv 

COS dv 
COS 
COS 6y 



0<X<| 

f <X<f 

,6 - X i 6 

,6 - X i 6 

6 — X i 6 



19.5/20 
26.7/20 
10.8/20 
20.1/20 
27.4/20 
25.3/20 



49% 
14% 
95% 
45% 
12% 
19% 



with the efficiency for reconstructing the low-momentum 
charged pion from decays. 

Further, how well the background event generation is 
modeled, e.g. how close the branching fractions in the 
event generator are to measured ones, affects the dis- 
tribution of background we subtract. The kinematic- 
variable dependence of the backgrounds, however, is not 
taken from the Monte Carlo but is measured in the data 
under the assumption that the difference between the 
Monte Carlo and the data can be represented by a lin- 
ear kinematic-variable dependence. The uncertainty in 
this measurement also contributes to the systematic er- 
rors. We are also able to check our background estimates 
using the goodness-of-fit as shown in Section VIII. 

Our systematic errors fall broadly into two categories: 

• A: Detector simulation, i.e., the accuracy with 
which the Monte Carlo reproduces the resolution 
and efficiency of the detector. We are most sensi- 
tive to how well it models the complex process of 
detecting and measuring low- momentum pions. 

• B: Simulation of B decays and background, i.e., the 
accuracy with which our event generation models 
the signal and background distributions and how 
well we correct for differences between data and 
Monte Carlo. 



A. Detector Simulation 

Extensive studies of the simulation of the detector 
response, including careful examination of track recon- 
struction and particle identification efficiencies, have 
been performed using selected data control samples. Ad- 
justments for known simulation deficiencies are used in 
investigating and evaluating the systematic errors. Form- 
factor measurements are insensitive to overall normaliza- 
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FIG. 12: Background-subtracted data (points) overlaid on 
Monte Carlo (histograms) for all four kinematic variable dis- 
tributions and for the electron momentum (p|) and transverse 
momentum (pt) of the slow pion for our best fit. In the bot- 
tom panel of each figure is shown the pull (difference over 
error) plot. The red line in the pull plot is not a fit, but is the 
line at zero shown for comparison (similarly for all following 
pull plots). 



tion errors. Thus differences in the efficiencies that are 
independent of the fit variables do not affect the results, 
but MC/data differences that vary as a function of these 
variables are of concern. 

To assess the uncertainties due to differences in shape 
rather than normalization (Sec. IX A 2-Sec.IXB 1), we 
vary the efficiency corrections, reweight the Monte Carlo 
samples with these modified corrections, and rerun the 
fit to the kinematic variable data. Finally, we take the 
difference between the results of these fits and those ob- 
tained with the nominal Monte Carlo simulation as an 
estimate of the systematic error on the parameters. This 
procedure is repeated for each source of detector-based 
systematic uncertainty. The individual uncertainties are 
added in quadrature to obtain the total systematic error. 



1. Charged Particle Identification (PID) 

Using data and Monte Carlo simulated control samples 
we have studied the difference in particle identification ef- 
ficiency Epid between data and Monte Carlo. We calculate 



22 



a correction factor, /^^d ' ^ ^ function of mo- 

mentum. For electrons the correction factors vary from 
0.991 to 1.008 over the momentum range from 1.2 GeV/c 
to 2.5GeV/c. We assess the impact of the uncertainty 
in these corrections by approximating their momentum 
dependence by hnear functions and varying the size of 
the small slope of these functions by its one-sigma uncer- 
tainty. The observed deviations from the default fit arc 
ARi = 0.0064, Ai?2 = 0.0052 and Ap^ = -0.0016 for 
the increased slope and -0.0032, -0.0031, -f 0.0009 for 
the decreased slope. We take half of the difference as the 
systematic error from this source. Since the momentum 
dependence is not a monotonic function, this procedure 
slightly overestimates the uncertainty. 

For kaon identification wc employ the same procedure. 
The observed variations arc significantly smaller. 

The probability of misidentifying the charged hadrons 
TT*, K^, p^, as electrons is small, less than 0.2% in the 



momentum range 1.2 — 2.5GcV/c. Since a variation of 
the peaking background by 9% results in a very mod- 
est change in the fit results, and since the fraction of 
this background originating from hadrons misidentificd 
as electrons is small, we conclude that the uncertainty in 
the hadron misidentification rate is negligible. 

The misidentification rate of pions as kaons as a func- 
tion of pion momentum ranges from a few tenths of a 
percent to almost 5%. However, pion misidentification is 
well simulated by the Monte Carlo and thus should have 
little impact on the fit results. Furthermore, the main 
consequence of pion misidentification is to increase the 
combinatorial background. Since we estimate the com- 
binatorial background from a fit to the measured Am 
distribution, we are not dependent on the Monte Carlo 
to assess the size of this background. Wc conclude that 
the uncertainty in the pion misidentification rate has lit- 
tle impact on the fit results. 
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FIG. 13; Background-subtracted data (points) overlaid on 
Monte Carlo (histograms) and pull plots for cos9e for six x 
cuts. 



FIG. 14; Background-subtracted data (points) overlaid on 
Monte Carlo (histograms) and pull plots for cos^v' for six 
X bins. These plots illustrate our sensitivity to interference 
effects. 
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2. Charged Particle Tracking 

The difference between data and Monte Carlo for the 
tracking efficiency for electrons, charged kaons and pions 
decreases roughly linearly as a function of momentum. 
Analogously to the method described for the PID error 
of Sec. IX A 1, we vary this linear dependence on the 
particle momentum in the Monte Carlo simulation and 
take the small deviation as the error from this source. It 
is a small contribution to the total systematic error. 



3. Slow Pion Reconstruction 

The efficiency for reconstructing the low-momentum 
charged pion (tTs) from the decay D*^~*D'^Trs is a ma- 
jor source of systematic error. Because of the small en- 
ergy release in D* decays, this pion is emitted in ap- 
proximately the same direction as the parent D* and its 
momentum in reconstructed events varies from approxi- 
mately 50 to 400 MeV/c in the laboratory frame, peaking 
at ~ 100 MeV/ c, and with a long tail extending out to 400 
MeV/c. Since w = Eo- /rno- in the B rest frame, the Hs 
momentum is correlated with w and thus its momentum- 
dependent efficiency impacts especially the measurement 

0fp2. 

The uncertainty due to the low-momentum tracking 
efficiency is evaluated differently from other tracking er- 
rors because such low-momentum tracks do not traverse 
the whole drift chamber. Their detection and measure- 
ment depends primarily on the silicon vertex tracker. To 
study this efficiency as a function of we use a large 
sample of D*'^ D'^'^t decays selected from all events 
and measure the distributions of the helicity angle of 
(6*^^) in the _D*+ rest frame as a function of the 
momentum. 

We parameterize the tt^ efficiency as function of its 
momentum using the form 

£{P-^J = Emax ( 1 - . ,r.(^ V ) (53) 

with pq being the threshold momentum and f3 controlling 
the rapidity with which the efficiency rises above thresh- 
old. For Ptj^ < po we set e to zero. We fit the data and 
Monte Carlo helicity angle distributions for f3, po and the 
coefficient of the cos^ 6*77^ term (allowed because the D* 
is in general polarized to varying degrees in each momen- 
tum bin, see Sec. VB) to obtain efficiency functions for 
the data and the Monte Carlo. The helicity method only 
determines the relative momentum dependence of the ef- 
ficiency. The absolute dependency is not relevant for this 
analysis. 

To assess the systematic uncertainty due to the tTs effi- 
ciency {eipTTg)) we weight the Monte Carlo simulation by 
the ratio of the data function to the Monte Carlo func- 
tion and assign the observed shifts in the fitted values 
for Ri, i?2 and as systematic errors. As expected, 



is the most sensitive (Ap^ = 0.018) to £{ptts) since, of 
all the kinematic variables, e(p7rs) most strongly affects 
the shape of the w-distribution, which is the kinematic 
variable that has the strongest impact on the extracted 
value. 

B. Event Simulation 

1. Final State Radiation 

Final state radiation, primarily from electrons in the 
decay chain, lowers the momenta and to a lesser de- 
gree changes the angles of detected particles. Though a 
physics effect, final-state radiation acts much like a res- 
olution ~ it smears the kinematic variables. We simulate 
the emission spectrum of radiative photons using PHO- 
TOS [20], so the resolution-efficiency correction proce- 
dure properly corrects for final-state radiation to the ex- 
tent that PHOTOS models it correctly. 

To test the sensitivity to uncertainty in the simulation 
of final-state radiation we evaluate the shifts in the fit- 
ted values of Ri, R2 and p^ between fits done with and 
without final-state radiation corrections. We assume an 
uncertainty of 30% in the simulated photon emission and 
thus take ~ 1/3 of the observed shifts (final-state radi- 
ation on vs. final-state radiation off) of 0.0129, 0.0067 
and 0.0039 as an estimate of the systematic uncertainty. 

2. Peaking Background Mixture Uncertainty 

The modeling of the peaking background depends on 
the knowledge of the branching fractions for the mixture 
of scmilcptonic B decay modes that make up this back- 
ground. These branching ratios and the form factors for 
these modes are not very well measured. To estimate the 
uncertainty associated with these branching fractions, we 
use a one-sigma variation of the procedure discussed in 
Section VIC, viz.: wc first fit the cos 6'_By distributions 
with a sum of functions where the signal fraction and the 
contribution due to the D** background are allowed to 
float. This fit is done in five equal-sized bins for each 
kinematic variable, over the full kinematically allowed 
range of the variable, in both data and Monte Carlo. We 
then take data-to-Monte Carlo ratios of the fit results 
per bin (shown in Fig. (10)), which we then fit to a linear 
function of the kinematic variables. While for the central 
value we use this function to weight the D** background 
itself before it is subtracted from the data, for the er- 
ror we vary the weighting function up and down by its 
one-sigma slope uncertainty as shown in Table II. The 
difference of the original fit minus the result of this fit 
gives us an estimate of the error due to the uncertainty 
in the shape of the D** background for each kinematic 
variable. We add the uncertainties for each kinematic 
variable in quadrature to give us the total error due to 
this source. 
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Another technique of estimating the D** background 
error is to vary the branching fractions of the P-wave D 
meson and non-resonant mode components while keep- 
ing the overall peaking background fixed. The results of 
using this technique come out to be of a similar scale to 
using the above method, which relies on fits to data. Us- 
ing both techniques would double-count the error, so we 
rely on the data-fit technique alone to estimate the error 
from this source. 



3. Combinatorial Background Shape Uncertainty 

A procedure similar to that employed for the peak- 
ing background shape is used to estimate the impact of 
the uncertainty on the shape of the combinatorial back- 
ground, except in this case we work with the Am fits 
rather than the cos 9by fits. As before, we do this fit 
in five equal-sized bins for each kinematic variable over 
the full kincmatically allowed range of the variable, in 
both data and Monte Carlo. We then take data-to-Monte 
Carlo ratios of the fit results per bin, and find the results 
shown in Fig. (9), which we then can approximate with 
linear fits. We use these fit results with their associated 
one-sigma slope uncertainties shown in Table I to weight 
the combinatorial background fraction up and down and 
fit to the data minus this rcweightcd distribution. The 
difference from the original fits gives us an estimate of 
the error due to the shape of combinatorial background 
uncertainty for each kinematic variable. Adding the un- 
certainties for each kinematic variable in quadrature gives 
us the total error from this source. 



4- Background Check with Goodness-of-Fit 

The quality of the fit as measured by the binned 
(see Sec. VIII) is sensitive to the background fraction as- 
sumed. Thus, we can use to check our estimate of 
the backgrounds. Figure 15 shows plots of versus the 
fractions of the combinatorial, D**, and other peaking 
backgrounds. These scans yield estimates of the back- 
ground fractions (in percent) of 

/comb = 5.4 ±1.3, 

6.1 ± 1.2, (54) 
/othorPk = 6.4 ± 0.9, 

which are in good agreement with the values of 5.33%, 
4.85% and 7.03% obtained from the Am and cos Oby fits 
in section VI C. 

The good agreement indicates not only that the sizes 
of these backgrounds are well-estimated, but that the 
shape in the four-dimensional kinematic-variable phase- 
space given by the Monte Carlo agrees well with their 
shape in the data. 



1340 
1338 




1336 r 



0.03 0.05 0.07 

fcomb 



1341 
1339 
1337 




1335 fa- 

0.04 0.06 0.08 




1332 '- 

0.03 0.05 0.07 0.09 

7-2005 ^OtherPk 
S699A20 

FIG. 15: of fit vs. fractions of combinatorial (/comb), 
D** [fo") and other peaking backgrounds (/othcrPk). From 
this we see that our method of ascertaining the goodness-of- 
fit is independently indicating that the values we obtained for 
these fractions from the cos 9by and Am fits were correct. 



5. Total Peaking and Combinatorial Background Fractions 

Another source of error is the uncertainty in the nor- 
malization of the peaking and combinatorial background 
fractions. To estimate this error we vary the fractions 
/comb, fo" and /othorPk by their uncertainties and as- 
sess the impact on the fit parameters. The uncertainties 
for the combinatorial and D** fractions are taken from 
the Am and cos9by fits. However, since /othorPk is in- 
put to these fits by scaling the Monte Carlo distribution, 
we take its uncertainty instead from the goodness-of-fit 
scan of the previous section. In varying /othorPk we keep 
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the total peaking /peaking = fo" + /othorPk fixed, since 
this behavior is observed when /othciPk is varied in the 
cos 9 BY fit. 



6. MC/Data Sideband Comparison 

The distributions of the kinematic variables for Monte 
Carlo and data agree well in the Am sideband region used 
to estimate the combinatorial background. But since the 
combinatorial background comprises about a third of the 
total background under the peak, the small differences in 
the shapes of the distributions could introduce an error in 
the background subtraction process. To estimate the im- 
pact of this effect, we first take the data-to-Montc Carlo 
ratios in the sideband region and then fit these ratios 
with polynomials. 

We next use these functions one at a time to multiply 
the combinatorial background from Monte Carlo before 
it is subtracted from the data to prepare the sample for 
fitting. We then carry through the fits and take the dif- 
ferences in the form factors we obtain from these with the 
form factors we obtained from the fits with the unaltered 
background. This procedure yields the results shown in 
Table VI. 

The differences are small. The largest is from using the 
function for w, from which we find Ap^ ~ 0.006. Since 
in the end we take the w-dependence of the combina- 
torial background from the data and the deviations due 
to weighting the angular distributions are small, we add 
nothing to the systematic uncertainty from this check. 



TABLE VI: Changes in the fitted parameters for reweighting 
of tlie Monte Carlo combinatorial background distributions in 
the four kinematic variables. 



Reweighted distributions 


Ri 


i?2 


•2 

p 


w distribution 


-0.002 


0.0 


0.006 


cosOe distribution 


0.001 


-0.002 


-0.001 


cosOv distribution 


0.002 


-0.003 


0.001 


X distribution 


0.004 


-0.002 


0.001 



C. Summary of Systematic Errors 

The systematic errors are summarized in Table VII. 
The largest contributions to the systematic errors of Ri 
and i?2 arise from the uncertainties in background com- 
position, and in the normalization of non-D** peaking 
backgrounds. On the other hand, the largest contribu- 
tions to the systematic error of arise from the uncer- 
tainties in the kinematic variable-dependence of the back- 
grounds, in the slow pion tracking efficiency and also in 
the normalization of non-D** peaking backgrounds. 



While the total error remains statistics-dominated, this 
result is only about a factor of two above the system- 
atic limit. An improved understanding of the systemat- 
ics would be needed if we were to extend this analysis to 
other D decay modes and a larger data set. 

TABLE VII: Summary of the estimated systematic errors for 
the baseline fit. 



Error source '^Ri '^R2 ^ ^ 

Charged particle track efficiency 0.005 0.004 0.003 

Slow pion track efficiency 0.003 0.000 0.018 

PID misID (electron, kaon) 0.006 0.004 0.003 

D" background normalization 0.003 0.002 0.005 

Other peaking backgrounds normalization 0.021 0.006 0.015 

Combinatorial background normalization 0.002 0.001 0.003 

Bkgd composition (branching fractions) 0.016 0.008 0.005 

Kinematic Variable-dependence of bkgd 0.001 0.001 0.028 

Final state radiation 0.005 0.002 0.002 

Total Systematic 0.027 0.013 0.035 



X. SUMMARY AND DISCUSSION 

Based on a sample of 16,386 i?" D*~e^Vf, events, 
we have measured the form factors Ai, V and A2 in 
terms of the HQET-inspired parameters R2 and p^. 
The baseline result, using the CLN parameterization of 
/lAi (w) (see Sec. II C) is obtained neglecting any possi- 
ble w-dependence of Ri {w) and R2 (w) , and including all 
errors is 

Ri = 1.396 ± 0.060 ± 0.035 ± 0.027, 

i?2 = 0.885 ±0.040 ±0.022 ±0.013, (55) 

p2 = 1.145 ± 0.059 ± 0.030 ± 0.035, 

where the first error is statistical, the second Monte Carlo 
statistical, and the third systematic. 

Note that the CLEO result discussed in Sec. I was ob- 
tained using the expansion in {w — 1) (see Eq. (10)) to 
linear order. When analyzed in this manner, the current 
dataset yields i?i = 1.40 ± 0.06, R2 = 0.87 ± 0.04, and 

~ 0.79 ± 0.06. CLEO also obtained using the form 
(see Eq.(13)) suggested by reference [12] that is very close 
numerically to the CLN parameterization we use. In this 
case they obtained ~ 1.42 ± 0.32 which is consistent 
with our result. 

The values of p^ obtained are very different using the 
linear parameterization, as mentioned in Sec. I. This 
is because the linear fit significantly underestimates the 
slope as it attempts to accommodate the higher-order 
terms {i.e., at w = 1, the absolute value of the slope is 
larger assuming a curve for the fit vs. a line). 



26 



The of this hnear fit is 1337.98 which is 1.32 units 
of worse than the basehne fit using the expansion of 
Eq. (11) (see Eq.( 52)). Thus while the basehne fit is 
shghtly better than the hnear, both are acceptable. 

The sensitivity needed to independently establish the 
w-dependence of Ri{w) and i?2(w) is not yet available, 
but theoretical predictions may be used to make com- 
parisons. If we compare the predictions of CLN [10] for 
Ri{w) and R2{w) (see Table III) to the baseline result, 
the shifts are found to be Ai?i = 1.42 - 1.27 = 0.15 
(« 2.0CT apart) and Ai?2 = 0.87 - 0.80 = 0.07 (« 1.5cr 
apart). For Ligeti and Grinstein [14] the numbers are 
Aiii = 1.42-1.25 = 0.17 (2.3cr) and Ai?2 = 0.87-0.81 = 
0.06 (w 1.3cr). The older prediction of Neubert [2] is 
closer with Ai?i giving « 0.6(t. If the theoretical error 
on i?i is ^ 0.03 as estimated by Ligeti [29] then there is 
a mild indication of disagreement. Higher statistics will 
be needed to resolve or confirm the possible discrepancy. 

These results allow a five-fold reduction in the largest 
source of systematic error in Vch measurements based on 
B D*lv decays to be made. Using the measurements 
of Ri and i?2 presented in this paper, we update the 
resuh from the 2004 BABAR paper [9] to find 

I Kb! = 37.6 ± 0.3(stai) ± 1.3(syst) t\%{theory) x 10"^ 

where small correlations between the present analysis and 
that of Ref. [9] have been ignored. The error due to the 
uncertainties in i?i and i?2 is reduced from ^2 6% *° 
±0.5%. The overall systematic error drops from ±1.7 to 
±1.3. The other systematic errors remain the same as 
published in Ref. [9]. 

A considerable improvement was also obtained in mea- 
surements of the lepton endpoint spectrum in b^udv de- 
cays. Compared to usage of the old CLEO i?— > D* form- 
factor measurements, usage of these newer form factors 
allowed the systematic error on the inclusive B^X^iv 
branching fraction for decays with a lepton in the mo- 
mentum range 2.0 — 2.6GeV/c to be reduced from 6.7% 
to 2.4%. In the higher momentum range 2.3 — 2.6 GcV/c 
the improvement was from 2.8% to 1.3%. This enabled 
a significant improvement in the measurement of [Kibj 
from the endpoint spectrum[30]. 

In addition we have demonstrated useful approxima- 
tions to the maximum likelihood method that allow us 
to cope with the limited size of the Monte Carlo sam- 
ples available to modern high- luminosity experiments. 
We have also developed the procedures needed to eval- 
uate the corrections and additional uncertainties due to 
these approximations. These methods are not unique to 
B-^D*£v or B^J/tpK* decays, but could be applied 
to any analysis with a complex multi-dimensional accep- 
tance and resolution functions. 

An approach to multi-dimensional goodness-of-fit cri- 
teria that allows assessment of the quality of a likeli- 
hood fit has also been developed. A binned measure, 
with the method developed to estimate the errors, yields 
a measure of goodncss-of-fit that has a straightforward 



statistical interpretation which is easily and intuitively 
understandable. 
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Appendix A: Evaluation of additional errors 

We supply here some of the mathematical details asso- 
ciated with biases and uncertainties introduced in our 
application of the maximum likelihood method. See 
Rcf. [24] for further details. 

Except as noted, in the below, we use the notation of 
Sec. VI. 



1. Bias From Approximate PDF 



approximation for the unknown !F{x] fi) in Eq. (31)). As 
the number of events N tends to infinity, the sum defining 
the likelihood can be replaced by the integral 



\nL = N J (ia;JF(x; /it) ln/(a;; /i) — A^ln J dxf{x]fi). 

(57) 

The second term comes from normalizing the PDF. If 
we expand about /Xt and keep only the leading terms, 
treating 



Suppose events are distributed as 



(56) 



but the unnormalized PDF used in the maximum like- 
lihood fit is f{x;fj.) (defined by Eq.(30) and used as an 



T{x;nt) 



/(a;; /it) 



J dxJ^{x; fit) jdxf{x;nt) 



(58) 



as being itself of order /i — /it , we find that the maximum 
occurs at the point where the derivatives given by 



S/i 



■ InL : 



dfj, \jdxT{x;^t) j dxf{x; fit) 



dlnf \ /d\nf 



are zero. All derivatives are evaluated at /i = /it. 
When we take the approximation 



f{x;fi) = J^(x;/inic) 



F{x;fi) 

F{x; /imc) 



and expand about fi = fi^, wc find to leading order 



91n/\ fdlnf 



dfj, 



i^i~^lt) (59) 



(60) 



J"(x;/it) 



/(a;; /it) 



T{x;fit) 



J dxJ^{x; fit) J d.xf{x; fit) J dxJ^{x;fit) 
Setting the derivative of the log-likelihood to zero, we have 



9 In ^ 5 In ^ 



dfj, 



(Mn 



Mtj 



-(^-Mt) 



dfi 
ainF 



ainF 

dfjL 



9/i 



<91nF 



d}nf\ 
Q, ) 

dliiF 
dfi 



dfi 



(/imc - /it)- 



ainF 
dfi 



d\nF 



dfi 



1 T 



(61) 



(62) 



Solving for the bias fi — fit yields the result Eq.(34). 

2. Error from Monte Carlo evaluation of 
normalization 

The error induced by the use of Monte Carlo integra- 
tion to normalize a PDF can be calculated considering 
the asymptotic form 

lnL = N J dxfix;fit)lnf{x;fi)-N\n[J f{x-fi)dx+e{fi)] 

(63) 



where now we are no longer concerned with the distinc- 
tion between T and /. The function e(/i) is the deviation 
of the Monte Carlo integral from the true integral. The 
maximization equation now reads 
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91n/\ / d\nf 




so, again setting the log of the likehhood to zero, the bias is 



(64) 



where we recognize that the term multiplying (fi ^ fit) in Eq.(64) is the inverse of the error matrix E. 

Now we suppose the Monte Carlo simulation to be accurate so there is no bias, but an uncertainty is introduced 
by the fluctuations in e. We therefore need an estimate of the matrix 



d e(pi) \ f d e{fi) 



dfJ.a J dyf{y;ii)J \dii J dy' f{y'; n) 



(66) 



A straightforward calculation gives 



1 



1 



[jdxff 



dx 
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dxf 
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dxf 



dl 
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dx 
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dx 



dfx 



dxf 



(67) 



To apply this to our circumstance where we perform a Monte Carlo integration with points having the density 
jF(x; //mc), we need to make the identifications 



Nmc ^ 



1 



/ dxT 



dxT{x\ flri 



f 



F{x-fi) 
F{x;fin^c)' 



(68) 



In the final expressions, we do not need to make the distinctions between /i, /imc, and fit which is only necessary when 
taking the derivatives with respect to fi. These differences arc of higher order. This results in the correspondences 



dx 



dxf 
df 



dfi 



where the derivatives are evaluated at fit 
In this way we find 
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(71) 
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(72) 



(73) 



where Mmc — Nmc I ^ data is the ratio between the num- data (the inverse of the i\^c used in Sec. VIII). That is to 
ber of MC events and the number of signal events in the 
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say, the error matrix due to Monte Carlo integration has where 
the same form as that for the statistical error from data, 
except that it is 1 /Nmc instead of 1 /A'data that enters. 



3. Pseudo-log-likelihood Error 

The pseudo-log-likelihood error can be computed from 
sums over the Monte Carlo sample used in the back- 
ground subtraction. In this case the weights {wi) are 
those used to weight each type of background to obtain 
the correct normalization as described in Sec. VI C (see 
Eq. (42)). 

We consider a pseudo-log-likelihood 



In/l 



E 

t(S) 



j(B) 

\nFsixj'; ^) - Ns / dxFs{x;n) 



' E 

j'{B,MC) 



(74) 



where the 5", B, and {B,MC) indicate that the sums 
are over signal events, background events, and back- 
ground MC events, respectively. Of course we cannot 
know event-by-event which are signal and which are back- 
ground events, but for the sum this doesn't matter. The 
signal distribution is indicated by Fs{x;ii); the back- 
ground is Fb{x). We expand about fi = fit and set 
{d/diia)^nA to zero to find the background subtraction 
error Es is given by 



NsEs\fi-fit) = 



T- 



5h\Fs{xi]ii) 
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S In Fs {xi : /.t) = In Fg {xi ; ^) - 
and 



/ dxFsix; Ht)lnFs{x; fi) 
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(76) 
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(77) 



where 



(A)g^ J dxFsix;nt)A{x)/ J dxFs{x; fit) (78) 

The fluctuations average to zero, but their squares do 
not. For example. 
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where now 

{A)b = j dxFB{x;^it)Aix)/ / dxFs^x- tit)- (81) 
Altogether we find 



(pi - pit) (pi - Ait) J = J^^s 



Appendix B: Background estimation 



(82) 



This appendix gives more detail on the background 
estimation procedures. 



1. Combinatorial background 

Figure 16 shows the distribution of Am for the data 
sample on a linear scale. The linear scale is used so the 
level of combinatorial background can be judged by eye 
from the tail extending outwards towards the right (the 
peaking backgrounds are underneath the signal peak here 
and their levels cannot be distinguished from the data). 
The combinatorial background appears to be of order 5%, 
but it is hidden under a signal that has substantial tails. 
It is difficult to distinguish tail from background. To 
extract the background fraction and estimate the signal 
this distribution is fit to a set of Gaussians represent- 
ing the signal and to a threshold function to model the 
background. To investigate the background shape and 
the tails of the signal Monte Carlo simulated events are 
used. 



i 3000 
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0.15 0.155 0.16 

Am (GeV/c2) 



FIG. 16: The Am data distribution for D*eu candidates on 
a linear scale. 

First, to investigate the shape needed to describe the 
background, a pure combinatorial background sample se- 
lected from the Monte Carlo simulation is used. 



Figure 17 shows the distribution of Monte Carlo com- 
binatorial background events fit to the (a) unextended 
(b) and extended threshold function, as discussed in 
Sec. VICl. Though not immediately obvious by eye, 
the unextended function systematically underestimates 
the data in the signal region near Am = 0.1454 GeV/c^. 
The true number in the signal box (0.143 < Am < 
0.148 GeV/c2) is 3540. The unextended fit yields 3400±39 
whereas the extended fit yields the much closer result 
3528 ± 52. Fits to subsets of the Monte Carlo data {e.g., 
those binned in the kinematic variables) show the same 
pattern with the fit to the extended function having much 
better success at estimating the number of background 
events than the unextended. 

Next, Figure 18a shows the Am distribution of a Monte 
Carlo simulated pure D* signal sample. There are sub- 
stantial tails and even the core is not consistent with a 
single Gaussian. It takes five Gaussians to achieve the 
good fit shown in Figure 18a. 

The fit to the full MC Am distribution is finally shown 
in Figure 18b. In this fit the three core Gaussians of the 
signal are allowed to float, but the tail is flxed. For the 
threshold function, the threshold and the scale factor are 
fixed while the other two parameters are fitted. This 
procedure reproduces the Monte Carlo input signal and 
background fractions. 

The Am distribution of the data with the fit as de- 



cb 



1325 ±65 



scribed is shown in Figure 7. We find 
which corresponds to a combinatorial background frac- 
tion of 6.4 ± 0.3% before cutting on cos^^y. 

To check the stability of this fit, the fitting condi- 
tions are varied. Letting the fourth Gaussian float yields 
1374 ± 92. The error on the difference is ±65, so this 
is consistent with the fixed fourth Gaussian used for the 
central value. Moving the long tail represented by the 
fifth Gaussian up and down by 50% results in a variation 
of w ±26 in the background estimate. This variation is 
added in quadrature with the error reported by the fit to 
yield a total error of ±70. The combinatorial background 
fraction with error becomes 6.39 ± 0.32%. 



2. Fits in kinematic variable bins 

To estimate the difference in kinematic- variable depen- 
dence between data and MC, bin- by-bin fits in five bins 
for each of the four kinematic variables are done. In the 
five-bin fits for the combinatorial-background kinematic- 
variable dependence (as discussed in Sec. VICl) it is 
found that only four Gaussians are needed to represent 
the signal shape and that the ±50% variation of the tail 
Gaussian produces negligible changes in the estimated 
background level. The A771 plots with fits are shown in 
Figure 19. The data-to-Monte Carlo ratio plots which 
result from these fits is shown Fig. 9 in Section VI. 
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FIG. 17: The Am distribution for combinatorial background selected from the Monte Carlo with fit to the unextended threshold 
function (a) and to the extended function. 
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FIG. 18: The Am distribution for (a) pure D* signal selected from the Monte Carlo with fit to five Gaussians and (b) the Am 
distributions on the full Monte Carlo sample, fit to the five Gaussian signal shape plus extended threshold function. 
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FIG. 19: The Am distribution for data in kinematic variable bins. The rows are fits for w, cos6e, cosOy and x in each of the 
five bins spanning the full range of the variable. 



3. Peaking background including D** background 

fits. 

Similarly, cos 9 by fits in five bins for each of the four 
kinematic variables are done to extract the shape of the 



D** background. The results of each fit are shown in 
Figure 20. The data-to-Monte Carlo ratio used to weight 
the background obtained from these fits are shown in 
Figure 10 in Section VI. 
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FIG. 20: The cos 6by distributions fit to background plus signal in five bins in each of the kinematic variables. Each row shows 
the plots for the five bins corresponding to the variables w, cosOi, cos6v and x- The shading is the same as that in Figure 10. 



